Eliminating Non-Referring Noun Phrases from Coreference Resolution

نویسندگان

  • Donna K. Byron
  • Whitney Gegg-Harrison
چکیده

Indefinite noun phrases in certain contexts are unable to support anaphoric coreference to an individual entity, and therefore should be ignored when searching for coreferent antecedents of anaphoric pronouns. However, many algorithms for anaphora resolution utilize noun phrase chunking or shallow parsing, and therefore do not make the needed distinctions to avoid this type of spurious antecedent. This study used simple syntactic criteria to remove indefinite phrases from consideration as antecedents to evaluate the effect of their removal on pronoun resolution. Pronoun resolution performance improved only marginally, revealing some interesting properties of current pronoun resolution algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An “Incremental” Clustering Algorithm for Coreference Resolution

Coreference resolution, also known as the process of linking noun phrases (NPs) referring to the same real world entity mentioned in a document, is a difficult and important task in natural language processing. This paper introduces an “incremental” unsupervised coreference resolution algorithm that can make the most of the transitive property in a coreference chain as well as the dependencies ...

متن کامل

Corpus-Based Learning for Noun Phrase Coreference Resolution

In this paper, we present a learning approach for coreference resolution of noun phrases in unrestricted text. The approach learns from a small, annotated corpus and the task includes resolving not just pronouns but rather general noun phrases. In contrast to previous work, we attempt to evaluate our approach on a common data set, the MUC-6 coreference corpus. We obtained encouraging results, i...

متن کامل

Global Learning of Noun Phrase Anaphoricity in Coreference Resolution via Label Propagation

Knowledge of noun phrase anaphoricity might be profitably exploited in coreference resolution to bypass the resolution of non-anaphoric noun phrases. However, it is surprising to notice that recent attempts to incorporate automatically acquired anaphoricity information into coreference resolution have been somewhat disappointing. This paper employs a global learning method in determining the an...

متن کامل

Corpus - Based Identi cation of Non - Anaphoric NounPhrasesDavid

Coreference resolution involves nding antecedents for anaphoric discourse entities, such as deenite noun phrases. But many deenite noun phrases are not anaphoric because their meaning can be understood from general world knowledge (e.g., \the White House" or \the news media"). We have developed a corpus-based algorithm for automatically identifying deenite noun phrases that are non-anaphoric, w...

متن کامل

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004